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(54) Abstract Title 

Compression decoding and re-encoding 



(57) An auxiliary data signal is derived from a received compression encoded audio signal during decoding 
of the signal and communicated with the decoded audio signal for use in re-encoding the signet. The auxiliary 
data signal may be communicated integrally with the audio signal, for example in the least significant bits, or 
may follow a different path. The provision of the auxiliary data signal may enable more transparent decoding 
and re-coding processes to talce place, by allowing the coding decisions to match the coding decisions 
onginally used; this can alleviate problems with quality reduction arising from cascaded decoding and 
re-coding processes. 

Application is to digital broadcasting in the studio environment with mixing, fading, etc. 
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At least one drawing originally filed was informal and the print reproduced iiere is talwn from a later filed formal copy. 
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AUDIO COMPRESSION 

This invention relates to compressed, that is to say data-reduced or 
bit-rate reduced, digital audio signals. 

The invention is applicable to a wide range of digital audio 
compression techniques; an Important example is the so-called 'MPEG 
5 Audio" coding, defined in ISO/IEC standards IS 1 1172-3 and IS 13818-3. 

In digital broadcasting, certain operations can be performed only on 
decoded audio signals. There is accordingly a requirement for compression 
decoding and re-encoding in the studio environment It is of course 
desirable that these cascaded decoding and re-encoding processes should 
10 Involve minimal reduction in quality. Studio operations such as mixing may 
be conducted on a digital PCM signal, although sometimes there will be a 
requirement for conversion of the PCM signal to analogue form. In the 
discussions that follow, attention will be focused on the use of a decoded 
audio signal in PCM format although It should be remembered that the 
15 Invention also encompasses the use of decoded analogue signals in 
analogue form. It will further be appreciated that whilst the digital 
broadcasting studio environment conveniently exemplifies the present 
invention, the Invention Is applicable to other uses of compressed audio 
signals. 

20 It is an object of the present Invention, in one aspect, to provide 

Improved digital audio signal processing which enables re-encoding of a 
compression decoded audio signal with minimal reduction in quality 

Accordingly, the present invention consists in one aspect In a method 
of audio signal processing, comprising the steps of receiving a compression 

25 encoded audio signal; compression decoding the encoded audio signal; 

deriving an auxiliary data signal; communicating the auxiliary data signal with 
the decoded eujdio signal and re-encoding the decoded audio signal utilising 
information from the auxiliary data signal. 

Preferably, the auxiliary data signal comprises essentially the encoded 

30 audio signal. 
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In one form of the invention, the auxiliary data signal is combined 
with the decoded audio signal for communication along the same signal path 
as the decoded audio signal. 

The Invention will now be described by way of example with reference 
to the accompanying drawings In whlch:- 

Rg. 1 is a block diagram of a digital broadcasting studio installation 
utilising an embodiment of the present invention; and 

Rg. 2 is a block diagram of similar form illustrating a second 
embodiment of the present invention. 

Refemng to Rg. 1, a coded audio bit-stream enters the decoder (D1) 
at the top left and the decoder produces a linear PCM audio signal, typically 
In the fomi of an ITU-R Rec. 647 CAES/EBU") bitstream, although other 
forms of PCM signal may be used. The PCM signal Is connected to the 
studio equipment (S) which may provkie such facilities as feding. mixing or 
switching. This connection is made via an Insertion unit PO which combines 
the auxiliary data signal with the PCM audio signal. Other audio sources are 
connected to the studio equipment; these are In the form of PCM signals, 
but some or all of them may previously have been coded, and those 
decoded locally may be accompanied by auxiliary data signals (e.g. the 
PCM signal from Decoder D2). The output of the studio equipment Is 
applied to the Input of the coder (C) via a signal splitter unit (Y) which 
separates the auxiliary data from the PCM signal. The output of the coder is 
a coded (i.e. digitally compressed) audio signal, in Fig. 1, the PCM signal 
path is represented by the solid line connecting the decoder and coder via 
the studio equipment If just a PCM signal arrives at the coder (i.e. the 
auxiliary data signal Is not present) the latter has to perfbmn an Independent 
re-codIng process. This Introduces Impairments in the form of coding 
artifacts into the signal (In the case of a PCM signal which has previously 
been coded, but without the auxiliary signal, these artifacts are additional to 
those present as the result of the earlier coding). 

In the example of an MPEG audio signal, the most important 
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information to carry with the signal is the positions of the coded audio frame 
boundaries. These frames are 24ms long when tlie sampling frequency Is 
48 kHz. 

The build up of impairments can be completely eliminated by avoiding 
5 decoding and re-coding wherever possible. For example, if enough of the 
original coded audio signal Is conveyed to the coder, as the auxiliary data 
signal, the coded audio signal can be reconstituted and substituted for the 
decoded and re-coded signal. This would require that the studio equipment 
pass the PCM signal transparently, and that the coded bitstreams to be 

10 switched or mixed are frame aligned, or can be brought into frame 

alignment Frame aligning can give rise to problems with audio/visual 
synchronisation ("lip sync') in applications such as television where video Is 
associated with the audio. 

Alternatively, if the auxiliary data signal indicates to the coder the 

15 positions In the PCM bitstream of the frame boundaries of the original coded 
signal, it Is possible to minimise any impairment introduced on re-coding if 
the original groups of audio samples which formed blocks of coded data 
(e.g. subband filter blocks or blocks of samples with the same scalefactor) 
are kept together to fomi equivalent blocks in the re-coded signal. Tills 

20 does not require frame alignment of coded bitstreams within the studio area, 
but it does require alignment of the appropriate data blocks within tiie 
bitstreams. Such alignment can be effected by the introduction of relatively 
short delays, which do not significantiy affect audio/video synchronisation. 
Furtiier reductions In the impalnment on re-codIng may be made if 

25 infomnation on the quantisation of ttie audio In the coded bltsti-eam Is 
conveyed to the coder (C). 

A furttier possibility is to move fiame boundaries In the Incoming 
coded bitstreams, whilst presen/ing the original blocks of coded data, to 
bring the frames closer to alignment. Relatively short delays can then be 

30 used to effect frame alignment by "fine tuning" the timing of tiie signals. 

Frame aligning the coded bitstreams in tills way, at a point where tiie entire 
incoming coded audio signal is available will minimize further impairment of 
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the audio, and re-coding will take place with the repositioned frame 
boundaries. 

If the frame boundaries are repositioned in such a way as to preserve 
the original block of samples with the same scale factor, only a partial 
decoding operation is needed. This technique is partlculariy suited to the 
editing of bit-rate reduced digital signals because full decoding and re- 
encoding can be eliminated. 

In the case where the studio is receiving MPEG audio coded signals 
in the forni of packetised elementary streams (PES), buffer stores In the 
decoders are used to ensure that the audio signals are conrectly timed to a 
local clock and appropriate) to associated video signals, using a 
programme clock reference (PCR) and presentation time stamps (PTS) 
within signals. The relatively small adjustments to signal timing needed to 
align blocks within coded bitstreams entering the studio with the blocks 
formed by the re-encodIng process in the coder (C) may be made either by 
making some adjustment to the timing In the decoders (D1, D2 etc.) or by 
introducing delays Into the PCM signal paths. 

In the arrangement of Fig. 1 . the auxiliary data takes the same path 
as the PCM signal through the studio equipment, and Is combined with the 
PCM audio m such a way that It has the minimal effect upon the audio. It is 
routed with the audio, and If the path Is not transparent (e.g. because of 
fading or mbdng) the modification of the auxiliary signal Is detected In the 
coder, and re-coding of the audio proceeds Independently of the auxiliary 
signal, if the path is transparent, the unmodified auxiliary signal facilitates 
the substitution of the re-coded PCM signal by the original coded signal, or 
re-codIng with the data blocks of the re-coded signal reproducing the blocks 
of the original signal as closely as possible, as described above. The dotted 
line of Rg. 1 . represents the path taken by the auxiliary data. 

Any modification of the signal and associated auxiliary data is 
detected by appropriate examination of the auxiliary data. For example, the 
auxiliary data may be accompanied by en-or-detecting cyclic redundancy 
check bits associated with the auxiliary data for each coded audio frame. 

3/7/2005. EAST Version: 2.0.1.4 



Audio signals whicli have not previously been coded will not be 
accompanied by any auxiliary data and will be impaired by the coding 
artifacts introduced by first-time coding when coded by the coder (C). 
Signals which have previously been coded but for which no auxiliary data is 
available will be impaired by additional coding artifacts when re-coded by 
the coder (C). 

Referring to Rg. 2, the PCi^ audio signal takes the same path 
through the studio equipment from the decoder (D1) to the coder (C) via the 
studio equipment (S). However, in this arrangement, the auxiliary data 
signal is not combined with the PCM audio but is routed separately. This 
arrangement has the advantage that the auxiliary data is not combined with 
the PCM audio, and there is no risk of audible changes to the signal as a 
result. This might be important, for example, if the studio equipment has 
only a limited resolution in terms of the audio sample word-length. 
Furthermore, the auxiliary data is not modified by ^ding or mixing. There 
are disadvantages in that the auxiliary signal needs to be delayed to keep it 
time-aligned with the PCM audio passing through the studio equipment (5), 
and switching is necessary in the auxiliary data path so that the correct 
auxiliary data is always presented to the coder (C) with the associated PCM 
signal. As in the arrangement of Fig. 1 , the coder needs to perform re- 
coding independently of the auxiliary signal at times when the path through 
the studio equipment (S) is not transparent. One way of ensuring that this 
happens is for the switch (R) which routes the auxiliary signals the coder to 
suppress all such signals when independent re-coding is necessary. 
Another way would be to add a subsidiary auxiliary data signal to the audio 
passing through the studio equipment (S) which would enable detection of 
non-transparent processing. This might be, for example, a known 
pseudorandom binary sequence (prbs) or some form of cyclic redundancy 
check data on some or ail of the audio data. 

In Fig. 2, the delay (T) required in the auxiliary data path should be 
constant, and may be determined by means of suitable tests. However, 
incoming MPEG audio coded bitstreams in PES form contain PTS, as 
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mentioned previously, and PCM audio signals can carry time Information 
(e.g. the time codes in the ITU-R Rec. 647 signal) which may comprise, or 
be derived from, the Incoming PTS. If the auxiliary signal contains the same 
Information, or the PTS Itself, the Initial setting of the delay CO and the 
subsequent verification of the amount of delay may be performed 
automatically. 

Examples of signals that could comprise the auxiliary data are: 

1. The coded audio signal at the input to the decoder (D1 , D2, etc.). 
This contains not only audio-related data and the PTS but also 
certain auxiliary information such as programme-associated data 
(PAD), which may need to be copied into the coded signal at the 
output from the studio area, and error protection. Depending upon 
the circumstances, such a signal would enable the coder (C) to 
substitute the original coded signal for the re-coded PCM signal, or to 
re-code the PCM signal with blocks of audio data resembling closely 
the blocks within the original coded signal, as described above. 
Conveying the coded audio signal to the coder provides the widest 
range of options for re-codIng with minimal additional Impairment of 
the audio. 

2. The coded audio samples at the input to the decoder minus the 
quantised audio samples (whteh can be re-created identically from 
the PCM audio signal). This is a signal in which the positions of the 
frame boundaries of the original coded signal are Indicated relative to 
the linear audio samples in the PCM signal, and from which the 
positions of the blocks of data within the frames may be deduced, 
together with information on the allocation of bits to the various 
components of the coded signal (sometimes known as "bit-allocation 
data"), scale factors, block lengths (in coding schemes where, this Is 
relevant), the PTS, and any other data relevant to the coding system 
in use. 
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3. A signal similar to that described In '2' above, but containing a subset 
of the information described (e.g. just the positions of the frame 
boundaries). 

Ways in which the auxiliary data signal might be transported with the 
5 PCM audio are: 

1. In the auxiliary sample bits of the ITU-R Rec. 647 bitstream. At the 
studio standard sampling frequency of 48 kHz, a total bit rate of 384 
kbit/s is available in the auxiliary sample bits of both "X" and T" 
subframes. This method is ideal for conveying the auxiliary data 

10 between different items of equipment but there is some uncertainty 

concerning the way in which studio equipment might treat these 
auxiliary sample bits. For example, the studio equipment may not 
route these bits through to the output with the PCM audio, or it may 
not delay these bits by the same amount as the PCM audio. In either 

15 case, some modification of the studio equipment, or of the 

environment around it, may be necessary. 

2. In the least significant bits (1-s.b.) of the PCM audio sample words of 
the ITU-R Rec. 647 bitstream. Depending upon the resolution of the 
studio equipment these may the same as the auxiliary sample bits 

20 (these are the l.s.b if the Rec. 647 signal is configured to carry 24-bit 

audio sample words) or the least significant bits within the part of the 
subfi'ame reserved for 20-blt audio sample words (these are the 
same bits that carry the 20 most significant bits of 24-bit sample 
words). Carrying the auxiliary data in the l.s.b. of the audio sample 

25 words overcomes the problems of routing within the studio equipment 

and care will be taken to ensure that the auxiliary data signal is 
inaudible. The studio equipment needs to be transparent to audio 
sample words of at least 20 bits. If necessary, the audibility of the 
auxiliary data signal could be reduced by scrambling (e.g. by the 
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modulo-2 addition of a pseudorandom binary sequence, or the use 
of a self-synchronising scrambler). Alternatively, it could be removed 
altogether by truncating the audio sample words to the appropriate 
length (I.e. to exclude the auxiliary data). 

In the user data bits of the ITU-R Rec. 647 bitstream. Taking the 
user data bits from both "X" and "Y" subframes provides a channel 
with a bit rate of only 96 kbit^s. In many applications this Is unlikely to 
be sufficient to cany the complete coded audio signal. It would be 
sufficient to signal the positions of frame boundaries, and to cany 
some other Infonnation extracted from the coded audio. With this 
method there Is uncertainty conceming the way in which studio 
equipment might treat the user data. 

In the upper part of the audio spectrum, at frequencies higher than 
those of the audible components of the signal. For this purpose, the 
PCM audio signal would be low-pass filtered, and the coded auxiliary 
data signal added above the passband occupied by the audible 
signal. A particularly ingenious way of doing this, when the studio 
area is receiving MPEG audio coded signals, would be to use an 
MPEG analysis subband filterbank with the reciprocal synthesis 
fllterbank at the insertion units PO In Rg. 1. At 48 kHz sampling 
frequency, the audio passband extends almost up to 24 kHz. In 
MPEG audio coding this passband is divided into 32 equally-spaced 
subbands. each with a bandwidth of 750 Hz. The upper five 
subbands are not used, and the audkj Is thus effectively low-pass 
filtered to 20.25 Khz. The auxiliary data could be Inserted into the 
upper subbands. and wouW be canied In the upper part of the 
spectrum of the PCM audio signal, to be extracted by another MPEG 
analysis filterbank at the splitter (Y) shown In Fig. 1. The PCM signal 
applied to the coder (0) would not need further filtering to remove the 
auxiliary data, as this would happen in the analysis filterbank In the 
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coder itself. 

5. The auxiliary signal might be a low-level known pseudo random 
binary sequence (prbs) added to the audio. The prbs would be 
synchronised in some way with the audio frame boundaries and may 
be modulated with additional data where possible. It Is also possible 
to subtract the prbs from the data prior to final transmission or 
monitoring. 

It has been explained that under certain circumstances it is 
appropriate to perform partial decoding and re-encoding. In the appended 
claims, the terms decoding and re-encoding should be taken as including 
partial decoding and re-encoding, respectively. 
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CLAIMS 



1. A method of audio signal processing, comprising tlie steps of 
receiving a compression encoded audio signal; compression decoding the 
encoded audio signal; deriving an auxiliary data signal; communicating the 
auxiliary data signal with the decoded audio signal and re-encoding the 
decoded audio signal utilising Infbnnatlon from the auxiliary data signal. 

2. A method according to Claim 1, wherein the auxiliary data signal 
comprises essentially the encoded audio signal. 

3. A method according to Claim 1 , wherein the auxiliary data signal 
comprises audio-related data from the encoded audio signal. 

4. A method according to Claim 3, wherein the auxiliary data signal 
comprises time information from the encoded audio signal. 

5. A method according to Claim 4, wherein the auxiliary data signal 
further comprises ancillary Information, such as programme-associated data, 
from the encoded audio signal. 

6. A method of audio signal processing, comprising the steps of 
receiving a compression encoded audio signal; compression decoding the 
encoded audio signal; deriving an auxiliary data signal indicative of the 
analysis and quantisation employed for the encoded audio signal; 
communicating the auxiliary data signal with the decoded audio signal and 
re-encoding the decoded audio signal utilising information from the auxiliary 
data signal such that the re-encoded audio signal employs the same 
analysis and quantisation as the encoded audio signal. 

7. A method according to Claim 6, wherein the analysis comprises 
application of sub-band filter bank. 
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8. A method according to Claim 7, wherein the auxiliary data signal Is 
indicative of the analysis Into sub-bands and the quantisation within each 
sub-band employed for the encoded audio signal. 

9. A method according to any one of the preceding claims, wherein the 
5 encoded audio signal is an MPEG audio coded signal. 

10. A method according to Claim 9, wherein the auxiliary data signal 
contains information relating to one or more of: the position of audio frame 
boundaries in the audio signal; scale factors for the blocks of sub-band 
samples within each audio frame; bit allocation data for the audio frame. 

10 

11. A method according to any one of the preceding claims, wherein the 
auxiliary data signal Is combined with the decoded audio signal for 
communication along the same signal path as the decoded audio signal. 

12. A method according to Claim 1 1 , wherein the auxiliary data signal is 
15 formatted to enable an integrity check prior to use of the auxiliary data signal 

in a re-encoding process, to ensure transparent^communication of the 
auxilisvy data signal along the decoded audio signal path. 

13. A method according to Claim 1 1 , wherein the auxiliary data signal is 
carried in tiie least significant bits of a digital decoded audio signal. 

20 14. A method according to Claim 1 1 , wherein the auxiliary data signal Is 
earned as user data bits in a recognized digital Interface format such as 
ITU-R Rec. 647. 

15. A method according to Claim 1 1 . wherein the auxiliary data signal is 
carried in tiie upper part of the audio spectrum. 
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16. A method according to Claim 15. wherein the auxiliary data signal is 
carried In higher frequencies associated with sub-bands unused In the 
compression encoding. 

17. A method according to Claim 16, In which MPEG audio coding Is 
employed, wherein a filter arrangement analogous to the MPEG analysis 
sub-band filter arrangement and Its reciprocal. Is employed for Insertion of 
the auxiliary data signal into the decoded audio signal. 

18. A method according to any one of Claims 1 to 10, wherein the 
auxiliary data signal is canried in a separate path to the decoded audio 
signal. 

19. A method according to Claim 18, wherein the auxiliary data signal 
path Is disabled In the event of processing In the decoded audio signal 
preventing sensible use of infomiatlon from the auxiliary data signal In re- 
encodlng. 

20. A method according to Claim 19. wherein a tell-tale is added to the 
decoded audio signal to indicate such processing. 
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